AITopics | world language

Collaborating Authors

world language

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?

Törö, Tuukka, Suni, Antti, Šimko, Juraj

arXiv.org Artificial IntelligenceJun-11-2025

Investigating linguistic relationships on a global scale requires analyzing diverse features such as syntax, phonology and prosody, which evolve at varying rates influenced by internal diversification, language contact, and sociolinguistic factors. Recent advances in machine learning (ML) offer complementary alternatives to traditional historical and typological approaches. Instead of relying on expert labor in analyzing specific linguistic features, these new methods enable the exploration of linguistic variation through embeddings derived directly from speech, opening new avenues for large-scale, data-driven analyses. This study employs embeddings from the fine-tuned XLS-R self-supervised language identification model voxlingua107-xls-r-300m-wav2vec, to analyze relationships between 106 world languages based on speech recordings. Using linear discriminant analysis (LDA), language embeddings are clustered and compared with genealogical, lexical, and geographical distances. The results demonstrate that embedding-based distances align closely with traditional measures, effectively capturing both global and local typological patterns. Challenges in visualizing relationships, particularly with hierarchical clustering and network-based methods, highlight the dynamic nature of language change. The findings show potential for scalable analyses of language variation based on speech embeddings, providing new perspectives on relationships among languages. By addressing methodological considerations such as corpus size and latent space dimensionality, this approach opens avenues for studying low-resource languages and bridging macro- and micro-level linguistic variation. Future work aims to extend these methods to underrepresented languages and integrate sociolinguistic variation for a more comprehensive understanding of linguistic diversity.

artificial intelligence, correlation, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2506.08564

Country:

Europe (1.00)
Asia (1.00)
Africa (0.68)
North America > United States (0.46)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.34)

Add feedback

New study tests machine learning on detection of borrowed words in world languages

#artificialintelligenceDec-12-2020, 02:04:04 GMT

Lexical borrowing is very widespread and may affect even those words that play an important role in our daily life. English'mountain', for example, was borrowed from Old French, along with many other words. Researchers from the Pontificia Universidad Católica del Perú and the Max Planck Institute for the Science of Human History have investigated the ability of machine learning algorithms to identify lexical borrowings using word lists from a single language. Results published in the journal PLOS ONE show that current machine-learning methods alone are insufficient for borrowing detection, confirming that additional data and expert knowledge are needed to tackle one of historical linguistics' most pressing challenges. Lexical borrowing, or the direct transfer of words from one language to another, has interested scholars for millennia, as evidenced in Plato's Kratylos dialog, in which Socrates discusses the challenge imposed by borrowed words on etymological studies.

detection, new study test machine, world language, (1 more...)

#artificialintelligence

Country: South America > Peru (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

New study tests machine learning on detection of borrowed words in world languages

#artificialintelligenceDec-10-2020, 01:08:22 GMT

IMAGE: Lexical borrowing is very widespread and may affect even those words that play an important role in our daily life. English'mountain', for example, was borrowed from Old French, along... view more Lexical borrowing, or the direct transfer of words from one language to another, has interested scholars for millennia, as evidenced already in Plato's Kratylos dialogue, in which Socrates discusses the challenge imposed by borrowed words on etymological studies. In historical linguistics, lexical borrowings help researchers trace the evolution of modern languages and indicate cultural contact between distinct linguistic groups - whether recent or ancient. However, the techniques for identifying borrowed words have resisted formalization, demanding that researchers rely on a variety of proxy information and the comparison of multiple languages. "The automated detection of lexical borrowings is still one of the most difficult tasks we face in computational historical linguistics," says Johann-Mattis List, who led the study. In the current study, researchers from PUCP and MPI-SHH employed different machine learning techniques to train language models that mimic the way in which linguists identify borrowings when considering only the evidence provided by a single language: if sounds or the ways in which sounds combine to form words are atypical when comparing them with other words in the same language, this often hints to recent borrowings.

detection, historical linguistics, new study test machine, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Amid coronavirus, students flock to Kahoot! and Duolingo. Is it the end of language teachers?

USATODAY - Tech Top StoriesApr-7-2020, 16:47:46 GMT

Every day, Massachusetts seventh-grader Kaylyn Wilson takes a break from doing homework online and opens an app on her phone for a half-hour foreign language lesson. "The boy has three green bikes and an egg," the 12-year-old announced to her family in French at the start of her third week using the mobile app from Rosetta Stone, the language-learning software giant. Wilson doesn't yet need to study a language for credit. But during the school shutdowns to contain the coronavirus, her father saw Rosetta Stone advertise free accounts for students – an offer other language-learning software companies have made as well. Wilson decided to give it a go.

duolingo, rosetta stone, student, (12 more...)

USATODAY - Tech Top Stories

Country:

North America > United States > Massachusetts (0.25)
North America > Canada > Ontario > Toronto (0.15)
North America > United States > New York (0.05)
(4 more...)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.72)
Health & Medicine > Therapeutic Area > Immunology (0.72)
Education > Curriculum > Subject-Specific Education (0.58)
Education > Educational Setting > K-12 Education > Secondary School (0.36)

Technology:

Information Technology > Communications (0.70)
Information Technology > Artificial Intelligence (0.47)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.41)

Add feedback